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ABSTRACT 

\ 

In  this  paper,  three  special  pursuit  games  on  cyclic  graphs  are  solved- 
These  games  are  relevant  to  unsolved  problems  initiated  by  a  game  without  a  value  ^ 
presented  at  the  International  Woricshop  on  Game  Theory  held  at  Cornell  University 
in  1978.  ,  ^ 

We  propose  thereto  the  following  problems  :  ' 

(i)  For  a  finite  or  infinite  game  without  a  saddle  point,  how  should  the 
players  make  their  decisions?  Here,  we  suppose  that  the  game  is  played  once. 
Thus  mixed  strategies  are  not  considered. 

(ii)  For  an  infinite  game  without  a  value  v,  i.e.,  vi<V2,  how  should  the 
players  make  their  decisions? 

To  answer  these  questions,  new  concepts  of  "quasi-equilibrium"  and 
"pseudo-equilibrium"  are  defined  and  it  is  shown  that  a  game  MDCPG  has  a  quasi¬ 
equilibrium  pair. 
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QUASI-EQUILIBRIUM  PAIRS  IN  PURSUIT  GAMES 
ON  A  CYCLIC  GRAPH:  SOME  MODIFIED  CASES 

by 

A .  Chames  and  D.  Zhang 


INTRODUCTION 

The  value  of  game  theory  applied  to  problems  of  pursuit,  search  and  ambush 
has  been  recognized  since  the  Second  World  War.  Many  of  the  games  which  arise  in 
such  situations  are  of  the  type  called  "geometric  games"  by  W.H.Ruckle  [R(W)1]. 
A  geometric  game  is  an  example  of  a  two-person  zero-sum  game  (see  section  2.0). 
Namely,  it  is  a  game  having  the  following  form:  Two  antagonists,  known  hereafter 
as  Red  and  Blue,  choose  subsets  r  and  b  respectively  of  a  set  S.  Blue  then  receives 
from  Red  a  payoff  which  is  a  function  of  the  triple  r,b  and  mb.  In  general.  Red  and 
Blue  may  not  choose  any  subset  of  S,  but  rather  Red  must  select  from  a  collection  R 
of  admissible  subsets  (pure  strategies)  for  Red  and  Blue  from  a  collection  B. 
Usually,  the  collections  R,  B  and  the  payoff  are  determined  within  a  geometric 
structure  on  S.  The  term  "geometric"  is  used  in  a  wide  sense  and  includes  finite 
structures  on  S.  A  geometric  game  is  described  by  specifying  S,  R,B  and  the 
payoff. 

Geometric  games  may  be  divided  into  three  categories:  (a)  finite  games,  for 
which  the  set  S  is  finite,  (b)  continuous  games,  for  which  the  set  S  is  a  line 
segment,  circle  or  region  in  Euclidean  space  of  two  or  more  dimensions  and  (c)  the 
case  in  which  S  is  a  countable  set,  such  as  the  countable  product  of  finite  graphs.  A 
pursuit  game  over  a  discrete  graph  belongs  to  category  (c). 

In  section  1  of  this  paper,  games  and  actions  will  be  described  either  in 
subset  terms  or  in  terms  of  functions. 


Suppose  G  is  a  graph,  N  a  set  of  positive  integers.  Let  B  as  well  as  R  consist 
of  functions  p  from  N  into  nodes  of  G  such  that  p(i+l)  is  connected  to  p(i)  by  an 
edge  of  G  (i.e.,  is  "adjacent")  in  the  nondirected  graph  case  and  at  the  end  of  a 
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directed  edge  leading  from  p(i)  in  the  directed  case.  If  Red  chooses  the  pure  strategy 
p  in  R  and  Blue  chooses  the  pure  strategy  j)  in  B,  the  payoff  to  Blue  is  the  smallest 
integer  m  such  that  p(m)=j3(m),  or  <»  if  p  (k)?tP(k),  for  all  k  in  N.  The  study  of 
these  games  leads  to  mathematical  difficulties  which  have  not  yet  been  mastered. 
[R(W)2]  gave  solutions  to  some  such  problems. 

(i)  PURSUIT  ON  A  COMPLETE  GRAPH  (PCG) 

A  graph  of  n  points  is  called  complete  if  every  pair  of  vertices  in  the  graph  G 
are  adjacent  (connected  by  an  arc).  The  value  of  the  PCG  to  Blue  for  complete 
graphs  is  n.  An  optimal  strategy  for  both  Red  and  Blue  is  to  choose  the  kth  point 
with  independent  probability  1/n  from  among  all  n  points  of  G. 

(ii)  PURSUIT  WITH  COMPARTMENTS  (PC) 

Consider  a  pursuit  game  on  a  graph  G  in  which  Red,  the  pursuer,  and  Blue, 
the  pursued,  have  different  capabilities.  As  far  as  Red  is  concerned,  the  graph  is 
complete.  On  the  other  hand,  Blue's  freedom  of  motion  is  more  restricted.  From 
Blue’s  point  of  view,  the  Graph  G  is  the  union  of  m  disjoint  complete  graphs. 

G  =GiuG2U...LXjin 

such  that  Gi  contains  ki  points.  This  means  that  at  time  1  Blue  may  choose  any  point 
of  G,  but  if  he  chooses  his  initial  position  in  Gi  he  must  thereafter  remain  in  Gf.  This 
game  is  called  Pursuit  with  Compartments  (PC). 

When  kj  =k  for  each  i  in  the  PC,  the  value  of  the  game  is  (m+l)/2+(k-l)m. 
An  optimal  strategy  for  Blue  is  to  choose  at  time  1  any  vertex  in  G  with  probability 
1/n  and  thereafter  choose  with  probability  1/k  one  of  the  k  vertices  in  the  subgraph  G, 
in  which  he  finds  himself.  An  optimal  strategy  for  Red  is  the  following:  (a)  at  times 
gm+1,  g=0,l,2,...  visit  any  vertex  with  probability  1/n;  (b)  at  times  qm+r, 
q=0, 1,2,..., where  l<r^m  visit  a  vertex  in  a  subgraph  not  visited  from  time  qm+1 
through  time  qm+(r-l)  with  probability  l/(n-(r-l)k). 

The  approximate  value  of  the  general  PC  is 

oo 

X  Tq, 

0 


where 
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rq=exP7fi - ^ 

y _ I - 

log(l-l/kj) 

A  rule  of  thumb  strategy  for  Red  is  any  pure  strategy  f  for  which 
Cq(i)=  number  of  integers  for  which  f(j)=q- 

It  has  the  value 

lAog(l-l/ki) 

m 

X  l/log(l-l/kj) 
j=i 

or 

lAog(l-l/ki)  ^  ^ 

m 

X  l/log(l-l/kj) 
j=i 

A  rule  of  thumb  strategy  for  Blue  is  to  initially  hide  at  any  vertex  with  probability  1/n 
and  thereafter  to  move  to  any  vertex  in  the  subgraph  Gi  in  which  he  finds  himself 
with  probability  1/ki. 

(iii)  PURSUIT  ON  A  DIRECTED  LINEAR  GRAPH  (PDL) 

The  value  of  the  PDL  (game)  to  Blue  is  n.  An  e-optimal  strategy  for  Blue  (see 
section  1.0)  is  to  choose  the  initial  point  xi  with  probability  e/(n-l)  and  remain  there 
or  choose  the  final  point  Xn  with  probability  l-e/(n-l)  and  (of  necessity)  remain  there. 
An  optimal  strategy  for  Red  is  to  proceed  directly  fiom  xi  to  Xn. 

(iv)  PURSUIT  ON  A  CYCLIC  GRAPH  WHEN  THE  PLAYERS  ARE  RESTRICTED 
TO  SYMMETRIC  AND  STOCHASTIC  STRATEGIES  (MCPG) 

Solution  of  the  pursuit  game  on  a  cyclic  graph  with  n  vertices,  CPG,  was 
described  as  an  unsolved  problem  at  the  International  Workshop  on  Game  Theory 
held  at  Cornell  University  in  1978.  Dr.  Ruckle  found  the  solution  of  the  CPG  in  the 
special  case  when  Blue  and  Red  both  are  limited  to  symmetric  Markovian  strategies 
(MCPG)  in  1981  [R(W)1].  That  is,  at  the  beginning  of  the  game  Blue  chooses  a 
number  p  and  Red  chooses  a  number  q  between  0  and  1/2.  At  each  turn.  Blue  moves 
left  with  probability  p,  right  with  probability  p,  and  remains  in  place  with  probability 
l-2p  while  Red  exhibits  the  same  behavior  with  q  replacing  p.  Assume  n>4,since 
when  n= 1,2,3  the  CPG  coincides  with  the  PCG  which  are  solved. 
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Let  qo  be  the  number  between  0  and  1/2  for  which  f(0,qo)=f(l/2,  qo),  where 
the  value  of  the  MCPG  is  f(0,qo).  An  optimal  strategy  for  Red,  the  pursuer,  is  the 
pure  strategy  qo-  An  optimal  strategy  for  Blue,  the  evader,  is  to  choose  p=0  with 
probability  to  and  1/2  with  probability  1-to  where 

-  (df(l/2,q)/dq)q=q, 

^  (df(0,q)/dq)q=q,-(df(l/2,q)/dq)q=q,- 

If  the  Markovian  strategies  are  not  symmetric,  the  arguments  in  [R(W)1]  no 
longer  hold.  The  situation  becomes  much  more  complicated[CZl].  Here,in  section  1 
we  discuss  the  pursuit  game  on  a  discrete  cyclic  graph  of  n  vertices  [R(W)2]  when 
Blue  and  Red  are  restricted  to  stochastic  strategies. 

Also  we  present  here  in  section  1  the  solution  of  CPG  when  its  payoff 
function  is  modified,  i.e.,  on  replacing  the  payoff  to  Blue 

min  i:  r(i)=b(i),  i=l,2,3,... 

oo,  r(i)9tb(i),  i=l,2,3,... 

by  the  payoff  to  Blue 

(r  b)=(  i=l,2,3,... 

l(oroo),  r(i);tb(i),  i=  1,2,3,... 

We  also  obtain  similar  results  for  solution  of  the  pursuit  game  on  a  directed 
cyclic  graph  with  n  vertices,  DCPG  [R(W)2]. 

As  will  be  seen,these  modifications  are  reasonable  and  interesting.  The  result 
obtained  here  could  help  us  in  understanding  the  complexity  of  CPG  itself. 

In  section  2  of  the  paper,  the  equilibrium  concept  is  discussed  and  a  new 
concept  "quasi-equilibrium"  is  proposed.  The  concept  of  optimal  solutions  to  a 
game,  hence,  is  generalized. 
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1.  PURSUIT  ON  A  CYCUC  GRAPH  -  SOME  MODIFIED  CASES 


1.0  TWO-PERSON  2ERO-SUM  GAMES  [0(G)  1] 


A  pursuit  game  on  a  cyclic  graph  is  an  example  of  a  two-person  zero-sum 
games.  This  section  describes  the  essentials  of  their  theory  employed  in  this  paper. 

A  two-person  zero-sum  game  is  defined  here  as  a  triple  (B,R,f)  where  B  and 
R  are  sets  and  f  is  a  real  valued  function  defined  on  the  cartesian  product  BxR.  The 
set  B  is  called  the  set  of  admissible  pure  strategies  for  Blue  and  the  set  R  is  called  the 
set  of  admissible  pure  strategies  for  Red.  The  function  f  is  called  the  payoff  function. 
The  underlying  ideas  in  describing  a  two  person  game  in  this  way  are:  (a)  if  Blue 
chooses  b  from  B  and  Red  chooses  r  firom  R  then  Blue  receives  f(b,r)  fi-om  Red  or 
equivalently  Blue  pays  -ffb^")  to  Red;  (b)  Blue  knows  exactly  what  is  in  R  and  Red 
knows  exactly  what  is  in  B,  but  neither  have  any  further  knowledge  of  what  element 
their  opponent  will  choose;  (c)  both  Blue  and  Red  know  f. 

If  both  B  and  R  are  finite  then  (B,R,  f)  is  called  a  finite  game.  Otherwise,  it  is 
called  infinite. 

If  the  following  two  conditions  hold: 

f(bo,ro)^  f(b,ro)  for  all  be  B, 
f(bo,ro)^  f(bo,r)  for  all  re  R, 

Then  the  pair  (bo,ro)e  BxR  is  called  a  saddle  point  of  the  game  (B,R,f).  These  two 


inequalities  are  equivalent  to  the  relation 

min  max 


r.R  b.B 


min 

reR 


(f(b.r)). 


If  a  game  (B,R,f)  has  a  saddle  point  (bo^’o)  the  number  f(bo,ro)  is  called  the  value  of 
the  game  and  the  pure  strategies  bo  and  ro  are  called  optimal  strategies  for  Blue  and 
Red,  respectively.  Not  every  finite  game  has  a  saddle  point. 

A  mixed  strategy  (3  for  Blue  is  a  real  valued  function  defined  on  B  with  the 
following  two  properties 

P(b)^  for  each  be  B, 

S  P(b)=l. 

tea 


In  other  words,  P  is  a  probability  distribution  or  probability  measure  on  B.  A  mixed 
strategy  for  Red  is  similarly  defined  as  a  probability  measure  on  R.  The  expected 
payoff  for  the  pair  of  strategies  (3,p)  is  given  by  the  formula 

f(P.P)=S  S  P(b)p(r)f(b,r). 

bEB  ieR 

A  triple  (Po.PO.v)  is  called  a  solution  for  the  game  (B,R,f)  if 

f(b,po)^v^f(Po>r)  for  each  (b,r)e  BxR, 
where  v  is  a  real  number  and  po.PO  ^e  mixed  strategies  for  Blue  and  Red, 
respectively.  The  value  v  is  called  the  value  of  the  game  and  Po  and  po  the  optimal 
strategies  for  Blue  and  Red  respectively. 

Minimax  Theorem  Every  finite  two-person  zero-sum  game  has  a  solution  in 
the  sense  of  mixed  strategies. 

However,  the  Minimax  Theorem  is  not  valid  for  infinite  games.  In  that  case, 
we  can  define 

“f(P,p). 


sup 

If  vi=v2  and  supinf  and  infsup  can  be  replaced  by  maxmin  and  minmax, 
respectively,  optimal  mixed  strategies  will  exist;  If  these  can  be  found,  the  game  is  as 
well  determined  as  the  finite  games  are.  If  only  vi=V2,  the  game  has  a  value 
v(vi=V2)  but  may  have  no  optimal  strategies.  It  will  have  an  "e-optimal  strategies". 
however,  if  given  any  e>0,there  exist  mixed  strategies,  P  and  p  for  Blue  and  Red, 
respectively,  such  that 

f(P4-)>v-e  and  f(b,p  )<v+e 

for  any  (b,r)€  BxR  Thus,  although  such  games  are  not  as  well  determined  as  the 
finite  games,  they  do  seem  to  show  a  type  of  stability  in  their  e-optimal  solution 
strategies. 

The  solution  of  a  game  in  terms  of  mixed  strategies  was  defined  from  an 
entirely  formal  point  of  view.  Besides  the  pure  mathematical  interest,  the 
fundamental  treatise  of  Von  Neumann  and  Morgenstem  [VMl]  presents  an  eloquent 


7 


case  in  favor  of  the  optimal  strategy  concept.  From  a  practical  viewpoint,  such 
optimal  strategies  have  been  observed  to  develop  (a)  in  animal  behavior  by 
naturalists;  (b)  in  contrived  children's  games  by  child  psychologists  and  sociologists 
and  (c)  in  repeated  game  playing  on  a  computer  by  computer  scientists. 


1.1.  DCPG  WHEN  THE  PLAYERS  ARE  RESTRICTED  TO  STOCHASTIC 
STRATEGIES 

A  directed  cyclic  graph  with  n  vertices  is  shown  in  Figure  1.  A  hider,  Blue, 
chooses  a  point  on  the  directed  cyclic  graph,  so  does  a  pursuer.  Red.  Blue  and  Red 
both  move  along  the  graph,  i.e.  they  remain  at  their  vertices  or  each  moves  to  one 
adjacent  vertex  pointed  to  by  an  arrow.  The  payoff  to  Blue  equals  the  number  of 
moves  required  by  Red  to  find  (or  catch)  Blue,  or  is  oo  otherwise.  The  collection  B 
of  pure  strategies  for  Blue  consists  of  all  functions  P  from  the  set  {1,2,...}  into 
the  nodes  of  the  directed  cyclic  graph  such  that  P(i  +1)  =  P(i)  or  P(i)  +  1  (1  if  P(i)  = 
n).  The  collection  R  of  pure  strategies  for  Red  is  the  same,  i.e.,  R  =  B.  This  is  the 
pursuit  game  on  a  directed  cyclic  graph  of  n  vertices,  DCPG.  It  is  unsolved 

[R(W)2],  i.e.,it  is  an  open  question  whether  or  not  it  has  a  solution. 

1 


We  discuss  here  the  pursuit  game  on  a  directed  cyclic  graph  of  n  vertices 
when  Players  Blue  and  Red  are  restricted  to  stochastic  strategies.  That  is,  at  the 
beginning  of  the  game  Blue  chooses  a  number  p  and  Red  chooses  a  number  q 
between  0  and  1.  At  each  turn.  Blue  moves  ahead  with  probability  p,  and  remains 
in  place  with  probability  I  -  p  while  Red  exhibits  the  same  behavior  with  q 
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replaring  p.  We  mainly  consider  n  ^  3.  When  n  =  1  or  2,  the  DCPG  coincides  with 
the  PCG  which  is  solved  [R(W)1, 2]. 

In  the  stochastic  DCPG  on  an  n  vertex  graph  a  mixed  strategy  for  Blue  is  a 
probability  distribution  defined  on  the  set  of  all  pairs  (k,  p)  where  k  =  0,  1, . . . ,  or 
n  - 1  and  0  ^  p  !S1.  The  integer  k  represents  the  initial  position  taken  by  Blue.  Let 
Tk  =  k  +  1  for  k  =  0,  1, .  .  . ,  n  -  2  and  T(n  -  1)  =  0.  While  P  is  an  "optimal" 
strategy  for  Blue  (for  which  Blue  gets  at  least  the  gain  floor)  then  Tp,  defined  by 
TP(A)  =  P({(Tk,  p):  (k,  p)  €  A})  is  also  "optimal"  since  T  is  a  renumbering  of  the 

^  n-i  |( 

vertices  which  retains  their  order.  Hence,  the  strategy  Po  *  —  P  is  also  "optimal" 

for  Blue.  The  distribution  Po,  therefore,  has  the  property 

Po({(k.  p):  0^p:S  1))=  1/n 

for  each  fixed  k.  So,  we  may  assume  Blue  and  by  the  same  reasoning  Red  chooses 
his  initial  vertex  with  probability  1/n  from  the  set  of  all  vertices. 

To  solve  the  problem,  let  us  consider  a  random  walk  MDCPG  equivalent  to 
the  above  stochastic  DCPG.  In  the  MDCPG.  a  single  counter  is  placed  with 
probability  1/n  on  one  of  the  vertices  of  an  n-point  directed  cyclic  graph.  The 
counter  indicates  the  distance  (modulo  n)  from  Blue  to  Red  along  the  clockwise 
direction.  If  the  initial  vertex  is  zero.  Blue  receives  one  and  the  game  ends. 
Otherwise,  at  each  stage  Blue  and  Red,  without  knowing  the  position  of  the  counter, 
each  vote  on  whether  it  is  to  move  ahead  or  remain  fixed.  Their  votes  are  combined 
so  that  if  both  Red  and  Blue  decide  upon  one  clockwise  or  both  Red  and  Blue  remain 
in  place  the  counter  remains  fixed.  If  Red  (Blue)  votes  one  clockwise  and  Blue  (Red) 
remains  the  counter  is  increased  (decreased)  one.  The  payoff  to  Blue  is  the  number 
of  stages  until  the  counter  reaches  zero. 

Suppose  at  the  outset  of  MDCPG,  Blue,  the  hider,  chooses  the  strategy  of 
moving  the  counter  (-1)  with  probability  p  and  staying  it  with  probability  1  -  p. 
Red,  the  pursuer,  chooses  a  strategy  of  moving  the  counter  (+1)  with  probability  q 
and  staying  it  counter  with  probability  1  -  q.  Thus  the  counter  moves  in  a  random 
walk  with  absorbing  state  at  zero.  Assume 

aij  =  prob.  {the  counter  moves  to  j  (mod  n)  |  the  counter  is  at  i} 


where 
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i,  j  =  0,  1,  2, . .  . ,  n  -1, 


so  that  A  =  (ay)  is  the  transition  matrix. 


We  have 


aj 


For  n  =  3, 


1. 

0. 


fp(1-q). 

(1-p)(1-q)  +  pq, 
q(1-p). 

lo. 


i-j=0. 

i  =  0;j-1.2 . n-1 

j-i-1  (modn) 
j  =  i?tO 

j  =  i  +  1  (modn),i?tO, 
otherwise. 


A=. 


1  0  0 
p(1-q)  (1-p)(1-q)  +  pq  q(1-p) 

q(1-p)  p(1-q)  (1-p)(1-q)  +  pq 


and  for  n  =  4 


10  0  0 
P(1-q)  (1-p)(1-q)  +  pq  q(1-p)  0 

0  p(1-q)  (1-p)(1-q)  +  pq  q(1 -p) 

q(1-p)  0  pi-q)  (1 -p){1 -q)+pq 


Let  ej  (j  =  0,  1, . . . ,  n  -  1)  denote  the  n  x  1  matrix  having  a  one  in  the  jth 
row  and  zeros  else  wise,  and  let  e  =  eo  +  ei  +  . . .  +  en-i-  The  initial  probability 
distribution  of  the  counter's  location  is  (l/n)eT.  At  time  k  the  distribution  of  the 
counter's  location  is  (l/n)eTAk-l  because  of  the  Chapman-Kolmogorov  equations 
[R(S)1].  The  probability  that  the  counter  is  at  vertex  j  at  time  k  is  (l/n)eTAk-lej. 
The  expected  payoff  to  Blue,  f(p,  q)  when  Blue  adopts  the  parameter  p  and  Red 
adopts  the  parameter  q  is 


where  the  sum  in  the  equation  can  diverge  to  infinity. 


Let  B  =  (bij)  to  be  a  (n  - 1)  x  (n  - 1)  matrix,  where  bij  =  aij  (i,  j  =  1,  2, . . 
n  -  1),  i.e., 

B= 

(1-p)(1-q)+pq  q(1-p)  0 

p(1-q)  (1-p)(1-q)  +  pq  q{1*p) 


p(1-q)  (1-p)(1-q)  +  pq  q(1-p) 

0  p(1-q)  (1-p)(1-q)  +  pq 


and  d  is  the  n-1  column  matrix  consisting  entirely  of  one's.  By  considering 
partitions  of  the  matrices  we  conclude  eTA^*l(e  -  e©)  =  d^B^^'M  for  each  k  =  1, 
2,„. .  From  this  we  get  the  following  conclusion, 

f(p.q)-1+l£d^B''d. 

(2) 

Lemma  1:  In  either  (i)  p  =  0  and  q  0,  (ii)  p  1  and  q  =  1  or  (iii)  p  0 
and  q  ^  1, 

Ul<i 

for  each  eigenvalue  XofB. 

Proof,  (i)  p  =  0  and  q  ^  0.  Then  B  has  an  eigenvalue  1  -  q  of  multiplicity 
(n-1)  (which  is  less  than  1  since  0  <  q  ^  1)  since 


B- 


* 


1-q 


N 

1  -q 


(ii)  p^  1  and  q  =  1.  Then  B  has  an  (n-1)  -  multiple  eigenvalue  p  <  1  since 


1 1 


(iii)  p  it  0  andq;t  i. 

Suppose  I X,  I  ^1,  where  X  is  an  eigenvalue  of  B,  and  u  is  an  associated 
eigenvector  such  that  maxi^  i  ^  n-i  I  Ui  I  =1. 

We  have 

n-l 

5^b|jUj=XUi,  1=1.2 . n-1. 

j-i 

Because  A  is  a  stochastic  matrix  and  aio  =  p(l  -  q),  it  follows  that 
p(1-q)+2;b,j-1. 

n-1 

Since  0  <  p  and  q  <  1,  p(l  -  q)  >  0  so  that  ^  ^ .  Hence 
lui  I  ^  IXU1  IlS 

i-1 

If  I  u  k  -1 1  <  1.  then  we  have 

|uKjs|xuk|s2^b|(j|uj|  +|ui,.i|b|,k-i  +2)hkj  <2)bkj  s  1 . 

1-1  i-1  j-k  j-i 

So,  1  uk  I  <1.  By  induction  we  get  |  ui  |  <  1  for  each  i  =  1,  2,  .  .  .  ,  n-1 
contradicting  our  hypothesis  that  maxi  s  i  s  n-l  I  ui  I  =1.  Hence  I X.  |  <1. 

Lemma  2:  When  Blue  chooses  the  parameter  p  and  Red  chooses  the 
parameter  q  in  the  MDCPG,  the  payoff  to  Blue  is 


l(P.q)’ 


p-q-Oorp-q-1, 


1  Ti 

1 +— d  (I-B)  d,  otherwise. 


Proof.  When  p  =  q  =  0  orp=q=l,  B  =  I,  the  unit  matrix.  Using  (2),  we 


get 


f(p.q)-1  + 


In  other  cases,  each  eigenvalue  X  of  B  has  absolute  value  less  than  1  by 
Lemma  1.  Using  the  Jordan  canonical  form  of  B,  we  get  that  S®  converges 

k-O 

absolutely  to  (I  -  B)-l  [G(F)1].  Then  because  of  (2),  the  lemma  holds. 


Q.E.D. 

Proposition  1:  For  the  MDCPG,  or  the  stochastic  pursuit  game  on  a  directed 
cyclic  graph  with  n  vertices,  vi  <  V2  and 


Vi »  Sup  inf  f(P.  q)  -  f  (1/2, 0)  -  n. 
P  q 

V2-infSup  f(p,q)-f(1/2.1/2). 
Q  p 


2 

n  +2 


where  P,  Q  are  mixed  strategies  taken  by  Blue  and  Red,  respectively.  That  is.  Blue 

can  get  a  payoff  which  is  not  less  than  n  if  he  takes  p  =  1/2  as  his  strategy.  And 

2 

n  ■t-2 

Red  can  pay  what  is  not  greater  than  3  if  he  takes  q  =  1/2  as  his  strategy.  This 
game  has  no  equilibrium  value. 

Proof.  First  we  claim  that  the  payoff  function  f(p,  q),  (3),  has  the  following 
properties, 

(i)  f(p,  q)  =  f(q,  p),  f(p,  q)  =  f(l  -q,  1  -  p). 

(ii)  f(0,  q),  f(l,  q),  f(p,  0),  f(p,  1),  all  are  monotone  strictly  as  being  showed 
by  arrows  in  Figure  2. 

(Ui)f(0,0)=f(l,  1)=  oo, 

n  +  1 

f(l,0)  =  f(0,  1)=  2  . 

f(l/2,  0)  =  f(l/2,  1)  =  f(0, 1/2)  =  f(l,  1/2)  =  n, 

2 

n  +2 

f  (1/2,  1/2)=  3  . 

(iv)  f(p,  p)  and  f(p,  1/2)  are  both  strictly  monotone  on  the  intervals 
[0,  1/2]  and  [1/2,  1], 


1  3 


2 

Figure  2 

As  examples,  we  calculate  f(l/2, 1/2)  and  prove  f(p,  p)  is  monotone  on  both 
intervals  [0, 1/2]  and  [1/2, 1],  respectively. 

From  (1),  we  get  (n  - 1)  x  (n  - 1)  matrix  (I  -B)  at  p  =  1/2,  q=  1/2  as  follows. 


So,  the  inverse  of  (I  -  B)(l/2, 1/2)  is 


a-B)i‘,(ii)= 


14 


Hence 


2 

n  +2 


n-1 

n-2 

n-3 

n-4 

111 

n 

n 

n 

n 

n  n  n 

n-2 

2(n-2) 

2(n-3) 

2(n-4) 

2*3  2*2  2*1 

n 

n 

n 

n 

■"  n  n  n 

n  -3 

2(n-3) 

3{n-3) 

3(n-4) 

3*3  3*2  3*1 

n 

n 

n 

n 

n  n  n 

n  -4 

2(n-4) 

3(n-4) 

4(n-4) 

4*3  4*2  4.1 

n 

n 

n 

n 

n  n  n 

1  «3 

2«3 

3»3 

4»3 

{n-3).3  (n-3).2  (n.3).1 

n 

n 

n 

n 

n  n  n 

111 

2«2 

hi 

4*2 

(n-3)*2  (n-2).2  (n-2).1 

n 

n 

n 

n 

n  n  n 

1*1 

hi 

3»1 

4»1 

(n-3).1  n-2  n-1 

n 

n 

n 

n 

n  n  n 

1  +ld^(I-B)’’| 
n 

IstI” 

.1*4 

n 

n-1 

liCn- 

i.1 

n-2 

•i)  +  2£ 

n-O+i) 

I  H 

i.1 

When  p  =  q, 


I-B  = 


2p(1-p) 

-P{1-P) 


-P(1-P) 

2p(1-p)  -p(1*p) 

-p<1-p)  2p(1-p)  -p(1-p) 


-P(1-P)  2p(1-p)  -p(1-p) 

-p(1-p)  2p(1-p) 


,  n-i  n-1 

|l-B|„.i-np  (1-p)  . 


,  •  /  n-2  n-2\ 

(l-B)n.i-(aijP  (1-p) 
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where  ay  is  a  constant  for  given  n,  (i,  j  =  1, 2, . . . ,  n  -  1).  So, 

f(p.p)  =  1+ldV-B)'’d-1+-^. 

n  p(1-p) 

where  c  is  a  constant  for  given  n.  From  f  (1/2, 1/2),  c  ^  0.  Hence 


df(p.p)^  c(2p-1) 

is  strictly  increasing  (decreasing)  on  interval  [1/2, 1]  ([0, 1/2]).  Since 
f(0,  0)  =  f(l,  1)  =  oo  and  f(l/2,  1/2)  <  oo. 

Now,  we  calculate  vi  and  V2.  Suppose  Red  takes  q  =  1/2  as  his  strategy, 

then 

maxf(P,1/2)-f(1/2,1/2). 

p 

For  any  q, 

maxf(P.q)^f(q.q)^J(i/2,l/2) 

P 

So, 

2 

V2”r]2jn  rTwl(P'Q)*Ul/2, 1/2)-^^-^^. 

Then  suppose  Blue  takes  p  =  1/2  as  his  strategy.  Then 
minf(1/2.q)-f{1/2.0)  . 

q 

For  any  p, 

min  f(P.q)s 
q 

So, 

max  min  f(P,  q)- 1(1/2, 0)-n. 

P  q 

Finally,  vi  <  V2  since  V2  -  vi  =  (n  -  2)(n  - 1)  and  ns  3. 

Q.E.D. 


f(p.O)  <({1/2,0),  p>  1/2 
f(p,1)<f(1/2,0),p<1/2 

. 


1.2.  PURSUIT  ON  A  CYCLIC  GRAPH 


Given  a  cyclic  graph  of  n  vertices  as  in  Figure  3.  Both  a  hider.  Blue,  and  a 
pursuer,  Red,  can  at  each  stage  remain  at  a  vertex  i,  move  to  i  +  1  (1  if  i  =  n)  or  i  - 1 
(n  if  i  =  1).  The  payoff  to  Blue  is  -1  if  Red  catches  (or  finds)  Blue,  or  1  (or  «>) 
otherwise.  The  collection  B  of  pure  strategies  for  Blue  consists  of  all  functions  P 
from  the  set  { 1,  2,  3, . . .  }  into  the  cyclic  graph  such  that  P(i  +  1)  =  p(i)  -  1,  P(i)  or 
P(i)  +  1  (mod  n).  The  collection  R  of  pure  strategies  for  Red  is  the  same,  i.e.,  R  = 
B.  This  two-person  zero-sum  game  is  called  a  pursuit  game  with  modified  navoff  on 
a  cyclic  graph  of  n  vertices.  CMPG. 


1 


To  solve  the  problem,  first  let  us  consider  a  problem  equivalent  to  SMPG. 
Take  an  infinite  set  L  having  an  underlying  structure  which  limits  the  B  and  R  as 
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follows.  L  is  a  "semi-infinite  rectangular"  array  of  lattice  points  having  n  rows  and 
countable  columns;  Thus, 

L=  ((i.D;i  =  1.2....;j=1.2 . n).ns3. 

And,  let 

B  =  R  =  (P  :  P  satisfies  some  condition  as  follows}, 
e.g.,  p  is  a  function  from  { 1,  2, .  . .  }  into  {1,  2, .  . .  ,  n  =  0  (mod  n)}  (or  a  path 
on  L)  such  that 

fp(i)-1  or 

P(i  +  1)  =  <P(i)  or  (modn) 

\p(i)  +  1 

The  payoff  to  Blue  is 

(P  =  PlO^PCO.  for  some  i  =  1.2,3.... 

\l  (or  oo),  otherwise, 

where  p,  p  are  pure  strategies  of  Blue  and  Red,  respectively. 

Let  Gc  =  R  =  B.  If  p  €  Gc»  then 

P(i  +1)  =  P(i)  +  P  (i  +  1)  (mod  n), 

where  p(i  +  1)  *  0,  - 1  or  +  1.  Let  p,  oe  Gc.  Define 

(P +o)(l)  =P(1) +  <t(1)  (modn) 

(p  +  a)(i+l)  =  (p  +  o)(i)  +  (p(i  +  1)  +  ^i  +  1))  (mod  n), 

where  (p(i  +  1)  +^i  +  1))  =  -1,  0  or  +1  (mod  3).  Then  we  know  Gc  is  a 
commutative  group  isomorphic  to  the  group 

M  =  Zn  +  Z3  +  Z3  +  Z3  + . . . , 

where  Zn,  Z3  are  cyclic  groups  of  order  n  and  3,  respectively. 

On  the  other  hand,  impose  the  discrete  topology  on  both  Zn  and  Z3.  Then 
regard  M  as  the  product  of  Zn,  Z3,  Z3,  Z3,  ....  Using  the  above  mapping 
(isomorphism)  between  Gc  and  M,  Gc  is  a  discrete  topological  space,  too.  Since 
Zn,  Z3  are  compact,  Gc  is  a  compact  topological  space. 

Suppose  Mzn,.  Mz,  are  Haar  measure  on  groups  Zn  and  Z3,  respectively. 

MznO)-^.  j-0.1,2 . n-l, 

MzaO)-^,  j-0,1. 


That  is 


Then  the  product  of  Mqc  of  Mzn.  Mz3,  Mzs, ...  is  the  Haar  measure  on  Gc  =  M  = 

Zn  +  Z3  +  Z3  + . . . .  That  is 

MGcIWk) - l^.k. 1.2.3..... 

n«3 

where 

[r]k={P:P(i)=r(i).  I^i^k}. 

Let 

Gcr  =  {P  P(i)  =  r(i)  for  some  i  =  1,  2, . . .  }. 

=  {P  :  P(i)  r(i),  i  =  1,  2, . . .  }  =  Gc\Gcr- 

And  let 

(r,  k)  =  {P  :  P(i)  ^  r(i),  i  =  1,  2, . . . ,  k). 

Then 

MGg(5cr)”  Bm  Mcjlr,  k)) 

k^oo 

and 

“oJ(f.K)) - 

n«3 

where  Sk  is  the  number  of  different  heads  cut  down  of  paths  in  (r,  k)  between 
column  k  and  k  +  1. 

Lemma  3:  MGc  (5^  =  0  for  any  re  Gc-  Hence  MGc(Gcr)  =  L 
Proof.  Let  us  look  at  Figure  5.  In  (i)  of  Figure  5,  n  =  4,  r(i)  =  0,  i  =  1,  2,  3, . 
...  All  heads  cut  down  of  paths  in  (r,  6)  are  shown  in  the  figure.  Attach  a  number  to 
a  node  as  follows.  1  to  each  node  in  the  first  column,  but  not  on  r.  At  each  node  from 
column  2  which  is  not  on  r,  the  attached  number  is  the  sum  of  numbers  located  on  the 
left  side  and  connected  up  by  a  line.  The  iteration  process  looks  like  that  of  the 


(i),  Figure  5  (n  =  4) 


70  +  90  +  70 


oco 


20 


21 


i  =  1.2,  3, 


Pascal  or  H.  Yang  Triangle.  Then  Sk  is  the  sum  of  all  numbers  on  the  kth  column. 
For  another  example,  in  (ii)  of  Figure  5,  82  =  1+  2  +  3+1=  7. 

For  given  k,  different  r  corresponds  to  different  Sk.  However,  Sk  of  (i)  for 
n  =  4  is  the  greatest.  For  (i),  figure  5,  suppose  s,  =  a  +  b  +  a.  Then  Sj  +  i  =  (a  +  b) 
+(a  +  b  +  a)  +  (b  +  a)  =  2(a  +  b  +  a)  +  b  =  2si  +  b  ^  2si  +  (l/2)si  =  (5/2)si.  And 
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Si*l/  , 

/4»3'  1  .5 

y  M  ^  ^ 

'4*3 


Therefore, 

MccPScr)"  lim  — “0. 
k— »«o4  •  3 


For  general  n,  the  proof  is  similar. 

Since  MQc  (Gc)  =  1,  we  get 

MGc(Gcr)=l-MGc{Scr)  =1-0=1. 

Q.E.D. 

Proposition  2:  For  a  CMPG,  the  Haar  measure  Mgc  is  an  optimal  strategy  for 
both  Red  and  Blue.  And  the  value  to  Blue  is  -1. 

Proof.  If  Blue  uses  the  Haar  measure  x  as  his  strategy  and  Red  chooses  r, 
then  Blue  can  expect 


((X.  r)  d  Mgc  M  -  (-1)  MOclGcr)  + 1  •  Mgc  (^r)  - 1 
Joc 


by  lemma  3.  Hence,  vi2:-l. 

By  symmetry,  if  Red  uses  the  Haar  measure  he  can  hold  Blue  to  the  expected 
payoff  -1,  i.e.,  v2  ^  -1.  From  -1  S  vi  $  v2  S  -1,  we  get  v  =  vi  =  v2  =  -1. 

Q.E.D. 

Corollary  1:  For  CMPG,  the  probability  that  Blue  escapes  is  zero  if  Red  takes 
Haar  measure  as  his  mixed  strategy. 

Similarly,  we  can  discuss  the  pursuit  eame  on  a  directed  cyclic  graph  of  n 
vertices  whose  navoff  is  modified.  DCMPG.  The  payoff  to  Blue  is  -1  if  Red  catches 
(or  finds)  Blue,  or  1  (or  <»)  otherwise.  Gd  =  R  =  B  consists  of  all  functions  j3  from 
the  set  { 1,  2,  3, . . .}  into  the  directed  cyclic  graph  such  that  P(i  +  1)  =  P(i)  or  P(i)  + 
1  (mod  n). 

Replacing  figure  3,  4,  5,  Gc  above  by  figure  1,  6,  7,  Gd,  separately,  we  get 
proposition  3  below.  Note 

Gd  =  M  =  Zn  +  Z2  +  Z2  +  Z2  +  •  •  • . 
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And 

1 

MGd(Wk)  =  n-2'“\  k=1.2.3 . 

n  =  5  =  0  000000>Q0  OO 

O  •  •  ■ 

O  ■  •  ■ 

P  •  •  • 

O  •  ■  • 

Lemma 4:  MGd{Gdr)  =  Oforany  re  Gd.  And  MGd(Gdr)  =  L 
Proposirion  3:  For  a  DCMPG,  the  Haar  Measure  Mg^  is  an  optimal  strategy 

for  both  Red  and  Blue.  And  the  value  to  Blue  is  -1. 

Corollary  2:  For  DCMPG,  the  probability  that  Blue  escapes  is  zero  if  Red 
takes  Haar  Measure  as  his  mixed  strategy. 

When  we  prove  lemma  4,  we  note  that  r  in  (iii).  Figure  7  corresponds  to  the 
largest  sk  for  given  k.  (See  proof  of  lemma  3.) 


2 

1 


o  o  o  o  o 


o  o  o  o  o  o  o 

Figtire  6  (n  =  5  =  0  (mod  5)) 
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r 

O  ■  ^  -0  '0  0  0  •  •  • 


r(i)  =  constant,  i  =  1 , 2,  3, . 


(i),  Figure  7 
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2.  QUASI-EQUILIBRIUM .  PSEUDO-EQUILIBRIUM 

As  we  know,  if  there  exists  a  saddle  point  (b*,r*)  for  a  game,  then  (b*,r*)  is 
an  equilibrium  pair  of  the  game.  In  the  case 

f(bj*)^f(b*,r*),  f(b*j)^f(b*,r*) 

for  any  pure  strategy  b  of  Blue  and  any  pure  strategy  r  of  Red.  In  other  words,  a  pair 
of  strategies  is  said  to  be  in  equilibrium  if  no  player  has  any  positive  reason  for 
changing  his  strategy,  assuming  that  the  opponent  is  not  going  to  change  strategies. 
If  in  such  a  case,  each  player  knows  what  his  opponent  will  play,  then  he  has  reason 
to  play  the  strategy  which  will  give  such  an  equilibrium  pair.  Thus  the  game  becomes 
very  stable. 

If  there  does  not  exist  a  saddle  point  (b*  j*),  we  consider  mixed  strategies. 
When  the  game  is  finite  there  exists  a  value  v(=vi=v2)  and  mixed  strategy  P*  for 
Blue  and  mixed  strategy  p*  for  Red,  respectively, which  yield  this  value.  The  pair 
(P*>p*)  is  said  to  be  in  equilibrium,  or  an  equilibrium  pair  of  the  game.  In  this  case, 

f(p,p*)Sf(p*,p*),  f(p*,p)^  f(P*,p*) 

for  any  mixed  strategy  p  for  Blue  and  any  mixed  strategy  p  for  Red.  When  the  game 
is  infinite  and 

vi=  S“P  irff(|3,p)=“f  f(p,p)=v,. 

P  P  P  P 

then  either  there  exists  (P*,p*)  such  that 

f(p,p*)$  f(p,p)  for  any  p 

and 

f(P*,p)S  f(p,p)  for  any  p, 

and  the  pair  (P*,p*)  is  said  to  be  in  equilibrium,  or  there  exists  (P*,p*)  for  any  given 
E>0  such  that 

f(p*,r)>v-e 


and 


f(b,p*)<v+e 
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for  any  (b4‘)e  BxR  and  the  strategy  pair  0*,p*)  is  said  to  be  in  e-equilibrium,  or  an 
e-equilibrium  pair  of  the  game.  The  strategies  P*  and  p*  are  called  an  e-optimal 
strategy  for  Blue  and  an  e-optimal  strategy  for  Red,  respectively. 

In  theory  and  practice,  the  following  problems  arise 

1.  For  a  finite  or  infinite  game  without  a  saddle  point,  how  should  the  players 
make  their  decisions?  Here,  we  suppose  that  the  game  is  played  once.  Thus  mixed 
strategies  are  not  considered. 

2.  For  an  infinite  game  without  a  value  v,  i.e.,  vi<V2,  how  should  the 
players  make  their  decisions? 

To  answer  the  first  question,  it  is  necessary  to  state  the  assumption  clearly. 
In  fact,  the  model  satisfies  the  following  assumptions. 

(a)  There  are  two  players  and  the  playoff  function  is  zero  sum. 

(b)  The  two  players  both  are  proficient  in  playing  the  game.  Namely,  they 
know  what  decisions  to  make  if  they  know  what  the  opponent  does. 

(c)  The  players  are  both  conservatives.  Namely,  they  always  suppose  that 
their  opponents  are  so  smart  that  they  know  beforehand  what  decisions  their 
opponents  will  make. 

(d)  The  players  have  played  the  game  one  or  more  times. 

Consider  finite  games,at  first.  In  such  a  case,  there  exists  a  strategy  pair 
(b*j*)  and  two  strategies  ro  and  bq  such  that 

f(b*«)=“g*  ™  f(bj) 

and 

f(boJ*)=  "j"  “g*  f(bj) 

^  (lb) 

for  all  (b,  r)eBxR.  If  (b*,ro)=(bo,r*),  i.e.,  b*=bo,  r*=ro,  then  the  pair 
(b*,r*)(=(b*,ro)=(bo,r’''))  is  a  saddle  point  of  the  game.  It  is  an  equilibrium.  If 
(b*Jo)  ^(bo,r*),  we  treat  the  game  as  follows. 

First,  suppose  the  pair  (b*,r*)  satisfying  conditions  (la  and  lb)  is  unique. 
Then  the  game  has  an  equilibrium  "to  some  degree"  at  the  pure  strategy  pair  (b*,r*) 
because 

f(b*,r)^  f(b,r)  f(b,r*)^  f(b,r) 
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for  all  b  j.  In  other  words,  no  player  has  a  positive  reason  for  changing  his  strategy 
because  any  change  could  cause  an  appropriate  change  by  the  opponent  so  that  the 
player  could  get  less  or  lobw  more.  We  call  such  a  pure  strategy  pair  a  "quasi-saddle 
point"  of  the  game.  If  the  game  is  played  once  or  a  few  times,  the  saddle  point  or 
quasi-equilibrium  pair  should  be  the  optimal  or  quasi-opdmal  strategy  pair  for  players 
Blue  and  Red.  If  the  game  is  played  a  large  number  times,  the  players  Blue  and  Red 
should  choose  the  saddle  point  or,  generally,  the  optimal  mixed  strategies. 

Second,  suppose  the  strategy  b*  satisfying  (la)  is  unique,  and  there  are 
several  pure  strategies  ri,  T2,  .../k  satisfying  (lb)  for  player  Red.  Select  a  strategy  r* 
from  ri,r2,...,rk  such  that 

f(bV)=  f(b*,ri). 

Then  the  strategy  pair  (b*,r*)  is  said  to  be  a  "quasi-saddle  point"  of  the  game. 

Similarly,  if  there  are  several  pure  strategies  bi,b2,...,bk  satisfying(la)  for 
player  Blue,  and  the  strategy  r*  satisfying  (lb)  is  unique,  then  the  strategy  pair 
(b*,r*)  is  said  to  be  a  "quasi-saddle  point",  where  b*  is  one  of  bi,b2,...,bk  such  that 

f(b’j*)=  f(bij‘). 

Third,  suppose  there  are  k  strategies  bi,b2,....bk  satisfying  (la)  and  1 
strategies  ri,r2,...,ri  satisfying  (lb).  Then  Blue  should  choose  pure  strategies  from 
only  bi,b2,...,bk,  Red  from  only  ri  J2.— Thus,  the  original  game  is  reduced  to  a 
new  one  which  is  simpler  than  the  original  generally.  Repeating  the  reducing 
process,  the  original  game  might  be  reduced  to  one  which  either  is  the  case  discussed 
above,  i.e.,  there  is  single  b  (or  r)  satisfying  (la)  (or(lb)),  or  cannot  be  reduced 
further. 

For  example,  a  game  with  payoff  matrix 

cannot  be  reduced  as  above.  Call  it  an  irreducible  game. 

In  the  later  case.  Blue  should  choose  any  one  of  bi,b2,...,bk  as  an  "optimal" 
strategy,  and  Red  should  choose  any  one  of  ri,r2,...,ri  as  an  "optimal"  strategy, 
where  {bi,b2,...,bk;ri,r2,...4l}  cannot  be  reduced  further.  Such  a  pure  strategy  pair 
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(b*,r*)  is  said  to  be  a  "pseudo-saddle  point"  of  the  original  game  if 
bs  { bi,b2>>*>>bi(} I  rs  {ri,r2f.tri}. 

For  infinite  games,  assume  that  there  exists  a  strategy  pair  (b*/*)  such  that 
f(b*,r)S^^f(b4-)  and  fCbj*)^  f(b,r) 

for  all  b,r.  Then  we  can  define  concepts  of  "quasi-saddle  point"  and  "pseudo- saddle 
point"  similarly. 

Now,  let  us  consider  problem  2  proposed  at  the  beginning  of  this  section. 
Besides  the  hypotheses  ((a),(b),(c),(d)),  we  add  one  more  as  follows. 

(e)  VI  can  be  reached  by  a  mixed  strategy  p*  for  player  Blue;  V2  can  be 
reached  by  a  mixed  strategy  p*  for  player  Red  in  any  game  concerned  in  the 
discussed  process.  Namely,  there  exist  P*  and  p*  such  that 


f(P*,p)Sinff(p,p)  for  any  P  for  Blue 


(2a) 


and 


f(P,p*)^  f(P,p)  for  any  p  for  Red 

P  (2|,) 

Then,when  the  pair  (P*,p*)  satisfying  (2a)  and  (2b)  is  unique,  the  pair 
(P*,p*)  is  said  to  be  in  "quasi-mixed  equilibrium",  or  a  "quasi-mixed  equilibrium" 
pair  of  the  game. 

When  the  strategy  P*  satisfying  (2a)  is  unique  and  there  exists  a  set  Ri  of  p 
satisfying  (25),  there  are  two  cases.  First,  there  exists  p*  such  that 

f(p*,p*)=inff(P*,p). 

Then  pair  (P*,p*)  is  then  in  quasi-mixed  equilibrium.  Second,  there  is  no  such  p*. 
Then  there  exists  p*  such  that 

f(P*,p*)^inff(P*,p)+e 

for  any  given  e>0.  (P*,p*)  is  said  to  be  in  "quasi-mixed  e-eauilibrium". 

Similarly,  when  there  exists  a  set  Bi  of  b  satisfying  (2a)  and  the  strategy  p-" 
satisfying  (2b)  is  unique,  there  are  two  cases.  First,  there  exists  P*  such  that 


f(p*,p*)=  sup 
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The  pair  (p*,p*)  is  then  in  "quasi-mixed  equilibrium".  Second,  there  is  no  such  P*. 
Then  there  exists  P*  such  that 

f(p*,p*)^  sup  f(p,p*).e 

for  any  given  e>0.  (P*,p*)  is  said  to  be  in  "quasi-mixed  e-equilibrium". 

If  there  exists  a  set  Bi  of  strategies  P  satisfying  (2a)  and  a  set  R1  of  strategies 
r  satisfying  (2b).  Then  Blue  should  choose  a  mixed  strategy  only  from  B  i.  Red  only 
from  Ri.  The  original  game  is  thereby  reduced  to  a  new  one  which  is  much  simpler 
than  the  original  one  generally.  Repeating  the  reducing  process  on  the  reduced 
games,  the  original  game  either  can  be  reduced  to  one  which  is  the  case  discussed  as 
above  or  can  be  reduced  to  an  irreducible  game  in  a  frnite  number  of  steps,  or  can  be 
reduced  without  stopping  i.e.,  the  reduced  game  which  is  gotten  in  any  finite  number 
of  steps  can  be  reduced  further.  In  the  latter  two  cases,  a  pair  (P*,p*)  is  said  to  be  in 
"pseudo-mixed  equilibrium"  or  an  pseudo-mixed  equilibrium  pair  of  the  game,  where 
(P*,p*)eBixRi,  and  BixRj  either  is  irreducible  or  can  be  reduced  without 
stopping. 

Proposition  1*.  for  the  MDCPG  in  section  2,  vi<v2.  In  this  case,  there  is  a 
single  P*  and  a  single  p*  satisfying  (2a)  and  (2b),respectively,  where  P*,p* 
correspond  p=l/2  and  q=l/2.  Thus,  (p,q)=(l/2,l/2)  is  a  quasi-mixed  equilibrium 
pair  of  the  game  DPCG. 

Finally,  it  should  be  pointed  out  that  when  there  exists  a  saddle  point  in 
problem  1  ( the  value  v(vi=v2)  in  problem  2),  hypothesis  (c)  can  be  replaced  by  the 
following  hypothesis. 

(c')  One  of  the  players  is  conservative,  and  the  other  is  astute,  i.e.,  he  knows 
that  his  opponent  is  conservative. 

However,  in  the  case,  the  theory  cteduced  above  about  quasi-  equilibrium  and 
pseudo-equilibrium  does  not  apply. 
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